Negation and Speculation Identification in Chinese Language

نویسندگان

  • Bowei Zou
  • Qiaoming Zhu
  • Guodong Zhou
چکیده

Identifying negative or speculative narrative fragments from fact is crucial for natural language processing (NLP) applications. Previous studies on negation and speculation identification in Chinese language suffers much from two problems: corpus scarcity and the bottleneck in fundamental Chinese information processing. To resolve these problems, this paper constructs a Chinese corpus which consists of three sub-corpora from different resources. In order to detect the negative and speculative cues, a sequence labeling model is proposed. Moreover, a bilingual cue expansion method is proposed to increase the coverage in cue detection. In addition, this paper presents a new syntactic structure-based framework to identify the linguistic scope of a cue, instead of the traditional chunking-based framework. Experimental results justify the usefulness of our Chinese corpus and the appropriateness of our syntactic structure-based framework which obtained significant improvement over the stateof-the-art on negation and speculation identification in Chinese language. *

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Negation and Speculation Target Identification

Negation and speculation are common in natural language text. Many applications, such as biomedical text mining and clinical information extraction, seek to distinguish positive/factual objects from negative/speculative ones (i.e., to determine what is negated or speculated) in biomedical texts. This paper proposes a novel task, called negation and speculation target identification, to identify...

متن کامل

Linguistic scope-based and biological event-based speculation and negation annotations in the Genia Event and BioScope corpora

Background: The treatment of negation and hedging in natural language processing has received much interest recently, especially in the biomedical domain. However, open access corpora annotated for negation and/or speculation are hardly available for training and testing applications, and even if they are, they sometimes follow different design principles. In this paper, the annotation principl...

متن کامل

A Unified Framework for Scope Learning via Simplified Shallow Semantic Parsing

s 94.99 94.35 94.67 Papers 90.48 87.47 88.95 Negation cue recognition Clinical 86.81 88.54 87.67 Abstracts 83.74 93.14 88.19 Papers 73.02 82.31 77.39 Speculation cue recognition Clinical 33.33 91.77 48.90s 83.74 93.14 88.19 Papers 73.02 82.31 77.39 Speculation cue recognition Clinical 33.33 91.77 48.90 Table 9: Performance of automatic cue recognition with automatic parse trees on the three sub...

متن کامل

Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora

BACKGROUND The treatment of negation and hedging in natural language processing has received much interest recently, especially in the biomedical domain. However, open access corpora annotated for negation and/or speculation are hardly available for training and testing applications, and even if they are, they sometimes follow different design principles. In this paper, the annotation principle...

متن کامل

Detecting Negated and Uncertain Information in Biomedical and Review Texts

The thesis proposed here intends to assist Natural Language Processing tasks through the negation and speculation detection. We are focusing on the biomedical and review domain in which it has been proven that the treatment of these language forms helps to improve the performance of the main task. In the biomedical domain, the existence of a corpus annotated for negation, speculation and their ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015